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Abstract 

In numerous applications data are observed at random times and an estimated graph of the spectral 
density may be relevant for characterizing and explaining phenomena. By using a wavelet analysis, 
one derives a nonparametric estimator of the spectral density of a Gaussian process with stationary 
increments (or a stationary Gaussian process) from the observation of one path at random discrete 
times. For every positive frequency, this estimator is proved to satisfy a central limit theorem with a 
convergence rate depending on the roughness of the process and the moment of random durations between 
successive observations. In the case of stationary Gaussian processes, one can compare this estimator 
with estimators based on the empirical periodogram. Both estimators reach the same optimal rate of 
convergence, but the estimator based on wavelet analysis converges for a different class of random times. 
Simulation examples and application to biological data are also provided. 

Keywords: continuous wavelet transform; fractional Brownian motion; Gaussian processes observed at 
random times; heartbeat series; multiscale fractional Brownian motion; nonparametric estimation; spectral 
density. 



1 Introduction 

In biology, finance, internet traffic, oceanography, civil engineering, etc., detrended data are often modeled 
by centered Gaussian processes observed at random times. Under some stationarity assumption (or by 
assuming the stationarity of increments), such processes are characterized by their spectral density function. 
Roughly speaking the spectral density function corresponds to the Fourier transform of the covariance of 
the process (or its increments), and in the sequel, it will be denoted by /(^), where ^ is the frequency. 

Before going further, let us give a detailed example: the heart rate variability. Cardiologists are inter- 
ested in the behavior of its spectral density, usually on both frequency bands (wi, UJ2) = (0.04 ffz, 0.15 Hz) 
and {uj2, UJ3) = {0.15 Hz, 0.5 Hz) corresponding to the orthosympathetic nervous system and the parasym- 
pathetic one, respectively, see Task force of the European Soc. Cardiology and the North American Society 
of Pacing and ElectroDhvsioloECv (1996). The soectral densitv follows different Dower laws on the different 
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frequency bands, i.e. /(^) = cjj |^|^^' when ^ G (a;j,a;j+i). Finally, according to the type of activity or the 
period of the day, we notice variations of these parameters, see Section [4.31 

As shown by the previous example, the spectral density contains relevant information. The phenomenon 
of interest is known through the observation of a Gaussian stationary process at random times. From a 
statistical point of view, there are two different situations regarding estimation of the spectral density. On 
the one hand, for Gaussian processes with stationary increments (which include fractal processes), most of 
the statistical studies concern the behavior of the spectral density when |,^| ^ cxd or in the neighborhood of 
0, with a regularly spaced sampling scheme, see Dahlhaus (1989), Gloter and Hoffmann (2007), Moulines et 
al. (2007), Begyn (2005) or the book edited by Doukhan et al. (2003). On the other hand, the estimation 
of the spectral density of stationary Gaussian processes is a classical problem, which has been studied 
by using periodogram methods, see Shapiro and Silverman (1960), Parzen (1983), Masry (1978a-b) or Lii 
and Masry (1994). A precise condition to avoid aliasing has been given in Masry (1978a-b), see Section 
[2j Unfortunately, Masry's Conditions are not satisfied in the heartbeat time series example. In this case, 
the durations between two observation times are bounded since they correspond to the time between two 
successive heartbeats. Thus, we have introduced a different modelling of observation times, see Section [2j 

Moreover, we prefer to use wavelet analysis rather than empirical periodograms. This approach was 
introduced for processes with stationary increments (in the particular case of fractional Brownian motion) 
by Flandrin (1992), see also Abry et al. (2003). From wavelet analysis, we derive a nonparametric estimator 
of the spectral density. Then this estimator is proved to satisfy a central limit theorem (CLT in the sequel). 
Numerical applications show the good accuracy of this estimator in the case of processes having stationary 
increments, e.g. fractional Brownian motion, as well as in the case of a stationary processes, e.g. the 
Ornstein-Uhlenbeck process. 

The rest of the paper is organized as follows: Section [2] gives a description of the problem. Section [3] is 
devoted to wavelet analysis and the CLT satisfied by the estimator of the spectral density /. This estimator 
is applied to generated and real data in Section HI Appendix contains the proofs. 

2 Description of the problem 

Consider first a Gaussian process X = {X{t),t G iR} with zero mean and stationary increments. Results 
can be however extended to the case where a polynomial trend is added to such processes. Therefore X 
can be written following a harmonizable representation, see Yaglom (1958) or Cramer & Leadbetter (1967). 
We adopt a more recent notation as in Bonami &; Estrade (2003), thus 



• the complex isotropic random measure dW satisfies dW = dWl+i dW2 with dWl and dW2 two inde- 
pendent real- valued Brownian measures (see more details on this part in section 7.2.2 of Samorodnitsky 
& Taqqu (1994). Therefore when g = gi+ig2 where gi and g2 are respectively even and odd real-valued 




for all t G IR. 



(1) 



where 
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functions such that fj^{gf{x))dx < oo (z = 1,2) then 1E[[ fj^g{^) dW{£,)) ] = fji^\g{x)\'^dx < oo. 
Moreover, \i h = hi + ih2 where hi and /12 are also respectively even and odd real-valued functions 
such that Jj^{hf{x))dx < 00 (i = 1,2), then 

1E[{ [ g{OdW{0){ [ hiOdWiO)] = [ 9{x)h(x)dx. (2) 
Jm Jm Jm 

• the function / is called the spectral density of X and is a non-negative even function such that 

/ (lA|eP)/(e)'iC<oo. (3) 

In the sequel, / will be supposed to satisfy also Assumption F{H) defined in subsection 13.21 but 
conditions are weak and the class of processes that can be considered is general. 

As a particular case, if X is a stationary processes, one still denote by / the spectral density such that 

X{t)= [ e^^^ f^^i^dWiO for all t £ M, (4) 
Jm 

where / is still a Borelian positive even function, but satisfying the stronger condition 

/ /(Ode<oo. (5) 

JM 

Even if the definitions are different, / denotes as well the spectral density of a process having stationary 
increments or a stationary process. Indeed, in the sequel we consider wavelet coefficients of X which have 
the same expression with respect to / for both models ([T]) and see more details in Proposition [TJ Define 
also the tr-algebra Tx generated by the process X, i.e. 

Tx ■=(y{X{t),te]R]. (6) 

A path of such a process X on the interval [0, T„] at discrete times tj"^ for z = 0, 1, . . . , n is observed, i.e. 

(X(4"^), X(4"^), . . . , ^(4"^)) is known, with = 4"^ < t^"^ < • • • < 4") = T„. 

A unified frame of irregular observed times, grouping deterministic and stochastic ones, will be considered. 
First let us assume that there exist a sequence of positive real numbers (5n)nG-JV and a sequence of random 
variables (r.v. in the sequel) (-Z^fc)fcejv (which could be deterministic real numbers) such that 

VfcG {0,l,...,n-l}, 4^1-4"^ ■= ^nLk, and 6n 0. (7) 

n— >oo 

For Z a r.v. and a G (0,oo), denote \\Z\\ci := (^1E{\Z\°')^^^°' if ]E{\Z\°') < 00. Now, assume that there exists 
s G [1, 00) such that 

Assumption S(s) {L}^)^^]^^ is a sequel of positive r.v. such that: there exist < rrii < Mi and Mg < 00 
satisfying 

mi < lELu = \\Lk\\i < Ml and ll-LtlL < M., for all k e JN. 
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Then we can also define: 

Assumption S(oo) {Lk)ke]N is a sequence of positive r.v. satisfying Assumption S(s) for every s G iV. 

For instance, it is clear that if (-Lfe)fe6jv is a sequence of exponential or bounded r.v., then Assumption 
S(oo) is satisfied. Now, r„ = (5„(Lo + . . . + -Ln-i) and under Assumption S(s) for any s> 1, 

mi X (nSn) < JE{Tn) < Mi x (n5„). (8) 

This point will be extensively used in the sequel to replace the asymptotic IE(Tn) — oo by n(5„ — oo. 

Comments on the modelling of observation times 

Assumption S(s) on the observation times may seem slightly unusual. This leads to the following comments: 

1. Generally, for processes observed at random times (see for instance Lii and Masry, 1994), the duration 
between observation times = (t^+i — tfc) are random variables not depending on the data length. In 
a sense, the asymptotic behavior only concerns the length of observation time T„ (which is necessary 
to estimate the spectral density at low frequencies). But the lag between two successive random 
observation times have to be, sufficiently often, very small to allow an estimation of the spectral 
density for high frequencies. Hence, observation times have to satisfy a strong condition and are 
typically of a Poissonian type. 

2. In our modelling there arc two asymptotic behaviors: the length of the observation time T„ converges 
to infinity and the mesh (5„ converges to 0. The first one is standard up to the slight difference that 
Tn can be random. This assumption is justified by numerous applications; for instance, the duration 
of a marathon is clearly random. Thus we have to replace the first asymptotic by iB(T„) — > oo. The 
second one is less standard but corresponds to applications. We have followed and transposed the 
idea of round-off introduced, to our best knowledge, by Delattre and Jacod (1997) and currently used 
today, see for example Robert and Rosenbaum (2008). In this setting, the time is continuous but 
round-off with a precision Then, the duration between observation times (t^*^! — ij."^) are the 
mesh 5n multiplied by positive random variables L^. Eventually, we do not assume that the r.v. 
are independent nor identically distributed. 

3. Our choice which is also relative to numerous application cases (see the example of heart rate variability 
below) has been to provide a spectral density estimation under very weak conditions on the observation 
times. Typically our results remain valid even for regular sampling, which is not the case under Masry's 
conditions defined below. 

4. In applications, signals arc observed at discrete times which are mostly irregularly spaced and random. 
This type of observations can be met in medicine, physics, mechanics, oceanography,... In these cases 
observation times depend on the measuring instrument, therefore of a random independent from that 
of the orocess X. In this context, the hvDothesis of indeoendence of durations iLh^i-^m and J^v is 
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completely realistic. The only case where this assumption seems restrictive concerns financial data. 
However it is until this day always made, see for instance, Hayashi and Yoshida (2005) or Ai't-Sahalia 
and Mykland (2008). 

Estimation of the spectral density, state of the art 

To our knowledge, the estimation of the spectral density of a Gaussian process with stationary increments 
on finite bands of frequencies, from observation at discrete times, is a new problem. Recall that the 
spectral density /(C) = C |C|~(^^+-'^) corresponds to a fractional Brownian motion (fBm in the sequel) with 
Hurst index H. However, most of the statistical studies devoted to the fBm or its generalizations concern 
estimation of the local regularity parameter (linked to the behavior of the spectral density at oo) or the 
long memory parameter (linked to the behavior of the spectral density in the neighborhood of 0). The 
estimation of the spectral density of stationary Gaussian processes is a classical problem corresponding to 
numerous practical applications, see Shapiro and Silverman (1960) or Parzen (1983). The used methods 
are based on the periodogram defined by /r(C) = {^ttT)'^ e^^^^X{t) dt . However, if {Xt^ , • ■ • , Xtj^) is 
known, with regularly spaced observation times ti = iA and T = tjsf = NA, then Huit^oo 1EIt{£,) = /(C) 
but limN^oo ^JwiO — J2kezf(^ + 2/c7rA~-^) where Jn is the empirical periodogram, that is Jn{£,) := 
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(27rA^A)~^ A ^^^^ e~*^'^^X(A;A) . Such a phenomenon is called aliasing. To avoid aliasing, random 
sampling is chosen and then the empirical periodogram becomes asymptotically unbiased. By using a 
spectral window an estimator of the spectral density can be deduced and it satisfies a central limit theorem 
(CLT) with a rate of convergence T^^/^, sec Masry (1978a-b) or Lii and Masry (1994). These results are 
obtained for random sampling satisfying very specific conditions that we will call in the sequel: 

Masry 's conditions: the process of observation times {tk)j^ is a stationary, orderly point process indepen- 
dent of X, with known mean rate /? and covariance density c(u) and satisfies the condition /3^+c(u) > 
a.e., where iV(-) is the associate counting process, /? = 1E[N(^{0, 1))] and c(u) its covariance density 
function (Masry, 1978, Cor. 1.1, p. 320). 

When the trajectory is not sampled but observed at random times not chosen by the experimenter, a first 
step before the estimation of the spectral density is to check that the family (ti) satisfies Masry's conditions 
and for this it is necessary to estimate the mean rate P and the covariance density function c(n). 

Wavelet based estimators 

Wavelet analysis was already used to estimate the parametric behavior of a power law spectral density 
when log |C| — oo or log |C| — — oo in a time series (with regularly spaced observation times). In the sequel, 
we will show that the wavelet analysis is also an interesting tool to estimate the spectral density for Gaus- 
sian processes having stationary increments (or stationary Gaussian processes) when a path is observed at 
random times. Let us underline that the wavelet analysis in Abry et al. (2003) is based on the sample 
variance of wavelet coefficients and thus is different from that proposed by Lehr and Lii (1997) or Goa et 
al. (2002) who respectively consider the wavelet decomposition of the estimator derived from the empirical 
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periodogram and the periodogram of the Haar wavelet transform of the process. In both these last cases, 
discrete time observations are supposed to satisfy Masry's conditions to avoid aliasing. 

We consider a non-parametric estimator of the spectral density based on a sample variance of wavelet 
coefficients. There are two main differences with the approach of Flandrin (1992) or Abry et al. (2003). 
Firstly, the definition of "empirical" wavelet coefficients, see (jlip . is adapted for non-regular observation 
times. Then a general CLT for sample variance of such "empirical" wavelet coefficients is established (see 
theorem [1]) and a CLT for a semiparametric estimator of the spectral density can be deduced for a large 
class of fractional processes. Secondly, one considers a sequel of mother wavelets Va ™- a way that en- 
ables the convergence, as A — > oo of |^Ap to a Dirac mass concentrated at the frequency = 1. Then a 
CLT for a nonparametric estimator of the spectral density is derived (see Proposition [2]) . For observation 
times satisfying Assumption S{s) with s > 2, the supremum of the convergence rate of this last CLT is 
Tn '^^^ ■ This is the same convergence rate as for the periodogram based estimator one (see for instance 
Lii and Masry, 1994), but for a class of observation times clearly more general than the Masry's one (see 
for example Lii and Masry, 1994). Indeed, our assumptions on observation times allow non-stationary or 
regularly spaced times, for Gaussian stationary processes and also for Gaussian processes having stationary 
increments (like fBm). However, a relation between r„ and 5n is required (see condition 1161 below). This 
condition depends on the regularity of the trajectory and the variability s of observation times. Therefore, 
in terms of the number n of observations, the convergence rate of our estimator Jl^"^ (^) is slower than n~^/^. 

Finally, let us add two comments on the choice and the advantage of wavelet based estimators. Firstly, our 
method plainly uses the time- frequency localization of the wavelet: in frequency, to build a nonparametric 
estimator of the spectral density from continuous time observations, and in time, to bound the error of 
approximation of the wavelet coefficient with discrete time observations. Conditions required on wavelet 
mothers are mild and satisfied by a large set of wavelets (Daubechies wavelet Dp for p > 6, Lemarie-Meyer, 
Morlet, Gabor, biorthogornal wavelets, . . . ) and only exclude Haar basis and Daubechies wavelet for p < 4. 
Actually, we do not need that the family of functions generated by dilations and translations forms a basis 
of L^(iR). Secondly, our wavelet based estimator can be applied to stationary processes as well as processes 
having stationary increments. Moreover it is robust to eventual polynomial trends. Such properties are 
induced by the number of vanished moments of the mother wavelet. A periodogram estimator does not 
satisfy such conditions and therefore can not be efficiently applied in so many cases. 

3 Main results 

This section contains three main results. In the first subsection, we specify conditions on the mother wavelet, 
and give a representation formula for the wavelet coefficients of the process. In the second subsection, we 
establish a CLT satisfied by the sample variance wavelet coefficients. This result provides the rate of 
convergence of a spectral density estimator in parametric or semi-parametric cases (for instance for a fBm). 
Eventually, the third subsection is devoted to a nonparametric estimation of the spectral density through 
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a localization procedure. 

3.1 Definition and harmonizable representation of wavelet coeflflcients 

Let ^ : ]R ^ W he a function, the so-called "mother" wavelet, and denote /(.^) = / e~^^^ f{x) dx the 

Jm 

Fourier transform of / G L^{M)nL'^{]R). Let (m,q,r) G M* x ]R\ and consider following set of assumptions 
on ip: 

Assumption W(m,q,r) ip : M^W is a differentiable function satisfying: 

• Number of vanishing moments: for all n < m + 1, / \t^ip{t) \ dt < oo, and 

Jm 

/ rV'(t) dt = for all n < m. 
Jm 

• Time localization: there exists a constant > such that for all t G M, 

{i + \t\y-\m\ < c^- 

• Frequency localization: there exists a constant C'^ > such that for all G M, 

{i + \c\Y-{\m\ + \^'{0\) < c;. 

The first condition of W(m, q, r) implies that V'CO = when ^ ^ and is (m + 1) times continuously 

differentiable. In the sequel, we assume at most W(l,3, 1/2). These conditions are mild and are satisfied 
by many famous wavelets (Daubechies wavelet Dp for p > 6, Lemarie- Meyer, Morlet, Gabor, biorthogornal 
wavelets, . . . ). It is also not mandatory to choose ^/^ to be a "mother" wavelet associated to a multiresolution 
analysis of 1L^{1R) and the whole theory can be developed without resorting to this assumption. 

Let (a, 6) G M*^ x M, and define dx{a,b) to be the wavelet coefficient of the process X for the scale a 
and the shift b, such that 

dx{a,b) ■■=^ [ (^-^) X{t)dt. 
This family of wavelet coefficients satisfies the following property: 

Proposition 1 (Harmonizable representation) Let ip satisfy Assumption H^(1,1,0) and X be a Gaus- 
sian process defined by m) or ^ with a spectral density f satisfying respectively ^ or Then, 

dx{a, b) = ^/^ [ e'''^ V^(aO f^^^iO dW{^) for all (a, b) e M*^ x M, (9) 
Jm 

and, for a> 0, {dxia,b))h^]fi is a stationary centered Gaussian process with variance given by 

]E(\dxia,b)\^^ =Ii{a) -.= j \i>{au)\'^ f{u) du for all b € JR. (10) 
The Droof of this oroDosition is erouoed with all the other oroofs in the Aooendix. 
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3.2 An estimator of the variance of wavelet coefficients and its application to the 
semi-parametric estimation of the spectral density 

Let us begin with an example. If X is a fBm with Hurst parameter H G (0, 1), its spectral density 
is /(^) = C|,^|~(^^+^) for ah ^ G iR* (with C > 0). Then for a scale a > a straightforward com- 
putation of the variance of wavelet coefficients Xi(a) defined in (|lUp shows that Xi(a) = K a^^'^^ with 

Of \i'(u)\'^ \ 
— „„ , du] . Therefore a consistent estimator of Xifa) furnishes a consistent estimator of H 
M u^^+^ J 

obtained by a log-log regression of (Ti(aj)) ^^.^^ onto (log aj)i<j<m- The same method works also for 
multiscale fBm (see Bardet and Bertrand, 2007b). 

Thus our first aim is the estimation of Xi(a). When only (X(tg"''), . . . , X(ti"''*)) is known, an explicit 
formula Ii{a) is not available for both the following reasons: 

1. on the one hand, dx{a,b) is defined with a Lebesgue integral and cannot be directly computed from 
data. As in Gloter and Hoffmann (2007), an approximation formula will be considered for computing 
wavelet coefficients. Thus, for (a, b) G M'^ x IR we define the empirical wavelet coefficient by 

n—l ^(") 

e,(„..):=-L2(/;%(i^)..)x(4"'). (11) 

i=0 



1 2 I 1 2 

2. on the other hand, a sample mean of |(ijs:(a, ^) instead on lE\dx{a,b)\ is computable only. Thus, 
define the sample estimator of Ti(a) by 

Jnia) ■■= — — y \ex{a,Ck)\ , (12) 
fc=o 

where (cfc)^ is a family of increasing real numbers (so-called shifts). In this paper, we will consider a 
uniform repartition of shifts, i.e. for A; = 0, . . . , n, 

Cfc = + with p G (3/4, 1) . (13) 

n 

In this example (cfc)i<fc<„ are random variables depending on random times (t^""*, . . . ,in"^) but c^+i — 
does not depend on k. We will see that it is not easy to consider a simpler expression of (c^); for instance 
Cfc = kTn/n could not be used because there would be some edge effects for estimating the wavelet coefficients 
in Co or c„. Therefore a sufficient "distance" from the boundaries and r„ is necessary. However, other 
choices of {ck)k are possible (for instance Ck = t^) but we have not been able to find an optimal choice and 
simulations do not show significant differences between these choices. Now additional conditions on / have 
to be considered: 

Assumption Y{H): f is an even function, differentiable on [0, cxd) except for a finite number K of real 
numbers = < cJi < • • • < ujk , but f admits left and right limits in Uk , with a derivative f ( defined on 
all open intervals {iOk^^k+i) with ujk+i = oo by convention) such that 

[ {l^\e)-\nO\d^<oo. (14) 
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Moreover, there exist Cq,Cq > and H > 0, such that for all \x\ > lvk 

fix) < Co|x|-(2^+i) and |/'(x)| < C'o \x\-'^^''+^\ (15) 
Here are several examples of processes having a spectral density / satisfying Assumption F{H): 

Examples ; 1. A smooth Gaussian process having stationary increments satisfies F{H) with H > 1 satisfies 
F{H) with H>1. 

2. A fractional Brownian motion with Hurst parameter H E (0, 1) satisfies F{H). Indeed, its spectral 
density is given by /(^) = C and corresponds to a power law of the frequency. 

3. However, fBm is a limited model. For instance, in some biological applications, statistical studies suggest 
that the logarithm of the spectral density is a piecewise affine function of the log-frequency, see for instance 
Collins and De Luca (1993) or Billat et al. (2009). Furthermore in certain frequency bands the slope 
corresponds to a Hurst parameter H larger than 1. For these reasons, in Bardet and Bertrand (2007a), we 
have introduced the multiscale fBm such that there exists a family of frequencies loi < ■ ■ ■ < ujk satisfying 
f{£,) = Ci for 1^1 e (wj, Wj+i) and z = 0, . . . , K, with the convention that = and ojk+i = oo, 
Hq Hk and {Ci,Hi) ^ ]R\ x ]R ioi i = I, . . . ,K - I. Then Condition ^ and Assumption Y{H) 
are checked with H = Hk- 

4. A stationary process with a bounded spectral density such as the Ornstein-Uhlenbeck process (for which 
/(C) := Q(7r(Q2 + ^2))-! with a > 0). 

The sample variance of wavelet coefficients Jn(a) computed from the observed trajectory {X{tl^^), . . . , X{t^^)) 
and defined by p2|) satisfies the following CLT: 

Theorem 1 Let X be a Gaussian process defined by |I]) or ^ with a spectral density f satisfying and 

Assumption F(H), tp satisfying Assumption H^(l,3, 1) and {ck)k defined by M3^) . If Assumption S(s) holds 

with s > 2 H Tl — ?>H^ , , and if 

2H ^ J+' 



then for all a > 0, 



]E{Tn)x5n ^'^^'"'^'^'^ ^0, (16) 



^''^'z)dz]. (17) 



/Ei;(j„(a) -Ti(a)) ^ AA(o,47ra2 / |V^(az)|V 

Remark 1 1. The convergence rate of the CLT (11) is lE{Tn) when Condition ^6\) is satisfied. A 
natural question is what happens elsewhere? This leads to the following comments: roughly speaking, 
from theorem U\ and lemma 0, one can deduce 

Jn{a) = Ji(a)+ [iE(r„)]-'/'rC/ + Cn (18) 



where U ~ AA(0,1), F^ = Atto"^ [■(/'(az)] f'^{z)dz corresponds to the variance in CLT {11) and (n 



corresvonds to the discretization error. As soon as Condition /flSl) is fulfilled, the discretization term 
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Cn is negligible with respect to the CLT term and the rate of convergence is ^JJEiTn). If the condition 
hlOjj is not satisfied, then the upper hound of the mean square error does no longer decrease when 
]E{Tn) ^ oo. 

2. It also possible to specify Condition by using a relation between 5n and n; for this, let 

6n = Cs with < d < I. 

The following Table 1 summarizes the possible choices of s and d and the supremum of the convergence 
rate of the CLT [T7\ ) ) for several cases. 





H known 

H>Q 


H unknown 
> 


H unknown 
H > 1/3 


H unknown 
H > 1 


Condition on s 




s = oo 


s > 2 


s > 2 


Condition on d 


, ^ I 1+(2HA1) N „ / s+(H Al) N 


d>\ 






ll + s(2HAl)/ " \s(2+(H/\l))-l) 






Supremum of the rate of 
convergence of CLT (ITTl) 


fors>2 + [l-3i?]_^ 

1 l + (ffAl) 

n 2 2+(ffAi) for s = oo 


1 

n 4 


n~i for s = 4 
n^To for s = oo 


n^i for s = 3 
for s = oo 


Supremum of the rate of 
convergence of CLT (1^ 


for s > 2 + 2ij. [l - 3i/] ^ 

2 l + (ffAl) 

„ 5 2+(ffAi) for s = OO 


1 

n 5 


n^s for s = A 
n^w, for s = oo 


n^'s for s = 3 
n^^> for s = oo 



Table 1: Conditions on s, d and supremum of the convergence rate of the semiparametric estimator of 
the spectral density f ( CLT |j7[ j ) and the nonparametric estimator of f ( CLT \22\) with A„ ~ CTn^^'^'^ ~ 
^^i/5(i-d)+K ^nfi > arbitrarily small) following the a priori on H . 

Note that CLT ()17p can be applied to an estimation of each Hi of a multiscale fractional Brownian 
motion when a trajectory is observed at random times. Indeed, in such a case and if tp is chosen such that 
/ only for ^ G [— /3, —a] U [a,/3], then (see details in Bardet and Bertrand, 2007b): 

Ji(a) = a^^'+^C/,^ for all i £ [a/u;,, /3/cUi+i], 

with Cf^^ > not depending on a. Therefore a log- log-regression of Jn(a) onto a for several values of 
a G [a/cjj, /S/wj+i] provides an estimator of Hi and Ci which follows a CLT with the same convergence rate 
as (fT7|l . Such a result may of course also be applied to fBm without specifications of the scales a. This is 
more precisely stated in the following 

Corollary 1 (parametric case) Let X be a fBm with parameters H £ (0, 1) and C > 0. Assume that 
{X („), ■ ■ ■ ,X {„)) is observed, that Assumption S(oo) is fulfilled, that ip satisfies Assumption W{1,3, 1) and 
that 5nIETn — > 0. Then there exists a constant c > such as for n large enough. 



IE 



Hn \ / H 



On ' \ C 



< W^- (19) 



lETn 



where Hn and Cn are the estimators obtained by log-log-regression of Jn{a) onto a. If moreover, the Hurst 
index H is known in advance to lie in the interval (1/3, 1), then Condition S(oo) can be replaced by Condition 
Sf2 + £) for anv £ > 0. 
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To our knowledge, only Begyn (2005) provides an asymptotic result on the estimation of H under irregular 
observation times but only in the case of fBm and with a stronger condition than Assumption S(oo). 

3.3 A nonparametric estimator of the spectral density 

The third result of this paper deals with the pointwise estimation of / through a localization procedure in 
theorem [T] Let us define the "rescaled" functions: 

M^) ■■= ^^'""Hl) (20) 

in a way that enables the convergence, as A — > oo, of Iv^aP to a Dirac mass concentrated at the frequency 
^ = 1. Then a rescaled version of the estimator (jlll I12p is introduced: 



k=0 i=0 



- ^ ^ E I E ^(4"') m - c.)) . (21) 

Prom ([20]). it is obvious that 



and after that 



v^A(e) = ^^(A(e-i)) veeiR, 

1x{o)-=l \My)? f{u/a)du^ f{l/a)\\nl2 when A 



oo 



under weak conditions. Then a CLT is established for the nonparametric estimator (|2ip with a sequence 
i^n)n satisfying A„ ^ cxd and under the assumptions of theorem [TJ Note that the first condition ^a ^ 
VF(1,3,1) is fulfilled as soon as A„ > A when is compactly supported in [—A, A]. Now, by using an 
appropriate choice of a sequence {tp\„), one obtains: 

Proposition 2 Assume that the assumptions of theoremUl hold. If the spectral density f is a twice con- 
tinuously differentiable function on M* , if ip is compactly supported, and if the sequence {Xn)n is such that 

— ^ — > and — ^ — > oo, then for all > 0, 

flOfi n^oo riOfi n—>oo 




n — ^oo ^ C 







^ du \ 




Tp{u)\^ du)^ ^ 



(22) 



— 1/2 

The rate of convergence of the parametric (or semiparametric) estimator is T„ , see CLT ()17p . In the 
case of a nonparametric estimator, using the optimal choice of A„, i.e. Xn = C(nJ„)^/^+'^ = 0{Tn^^^'^) 

—2/5 

with K > arbitrary small, the supremum of the convergence rate of the estimator is T„ . This is 
the same rate of convergence as for the periodogram of a stationary process observed in continuous time 
(Parzen, 1983) or observed during random times satisfying Masry's conditions (Lii and Masry, 1994). 
However, in this last case, Tyi ^ Cn p.s. when ri — > oo. Our result, that is CLT (j22p . is clearly more 
general: it concerns processes having stationary increments and satisfying weak conditions on the random 
observation times. But the prize to pay for obtaining the convergence rate Tn is that Tn ~ 

Cn^-'^ with 

d > ( ~^ \r, -!\ ) V (~77T~~777 — TyT — ^'^^ instance ~ Cna for s = 3 and i7 > 1, or for s = oo 
and H > 0: more details are orovided in Table 1. 
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How to explain that this convergence rate depends on s and H? On the one hand, the smaller H the 
more irregular the trajectory of X when X is a process having stationary increments (the Holder parameter 
of a trajectory of X is then for all < H). Therefore empirical wavelet coefficients, defined almost 
as Riemann sums approximate better a smooth path than an irregular path and this explains that the 
smaller H the smaller the convergence rate of CLT (|22p . In the case of stationary processes, H > 1 
and the convergence rate does not depend on H. On the other hand, the smaller s the more variable 
the observed times. Then for each frequency there are not enough successive data with appropriate lag 
allowing to correctly estimate the spectral density around this frequency. Then the smaller s the smaller 
the convergence rate of CLT (j22|) . 

Moreover, under W(m, 3, 1), / = for .11 n < m and all wavelet coefficients of any polynomial 

function with degree less than or equal to m vanish. Therefore, the estimator /A ' is robust, since: 

Corollary 2 Under Assumption W(m, 3,1) with m £ IN*, Proposition holds when a polynomial trend 
with degree less than or equal to m is added to X. 

4 Numerical experiments 

For the numerical applications, one has chosen: 

1. ip is chosen such that V(0 = exp ( — (|^| • (5 — ICI))""'^) l|g|<5(05 which satisfies Assumption W(m,r) 
for any (m, r) (and '0('^) = for |^| > 5). 

2. 6n = n~^'^ for insuring the convergence of fn'"\£,) for H > and s > 3. 

3. A„ = n'^' with 1/6 < d' < 1/2. The admissibility condition on wavelets {il^x„) requires that n'^' > A = 5 
and after numerous simulations, we have chosen d' = log(15)/log(n). 

4.1 Estimation of the spectral density of a fractional Brownian motion observed at 
random times 

For a standard {1EX'^{1) = 1) fBm with Hurst parameter H, /(^ = C{H) {(^I'^^-'^dS, with C{H) = 
(i?r(2i?) sin(7r/7))/7r. Three different kinds of independent and identically distributed random times are 
considered: 

(Tl): non-random uniform sampling, such that Lk = 1 for all k € JN*; 
(T2): exponential random times, such that JELf^ = 1 for all k £ IN*; 

(T3): random times such that for for all k G IN*, the cumulative distribution function of is Fl^[x) = 
(1 — x~'^)\x>i implying EL^^ < oo for all p < 4 and lEL'^ = oo. In this case Assumption S(s) is 
satisfied if and only if s < 4. 
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(T4): random times such that for for all k £ JN* , the cumulative distribution function of L^. is Fl^{x) = 
(1 — x~^)la;>i implying JEIj^^, < oo for all p < 2 and EL^ = oo. In this case Assumption S(s) is 
satisfied if and only if s < 2. As a consequence, this case does not satisfy the hypothesis of theorem 

m 

An example of such estimation of the spectral density for H = 0.2, = 50000 and random times T2 is 
presented in Figure 1. The results of the simulations are also provided in Table 2. 

Fig. 1 An example of the estimation of the spectral density (left) and its logarithm (right) of a FBM 
observed at exponential random times (T2) with confidence intervals (H = 0.2, N = SOOOOj. 



10 r 2.5 r 




0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 ^1.5 -1 -0.5 0.5 1 1.5 2 

Chosen frequencies (Hz) Logarithm of fhe frequenoies 



Comments on simulation results: 

1. The larger A^ the more accurate the estimator of / except for the case of random times T4 (a case 
not included in the conditions of Proposition [2]) ; 

2. The results are similar for Tl and T2, a little less accurate for T3; 

3. The smaller H the more accurate the estimator of /. 

4.2 Estimation of the spectral density of a stationary Ornstein-Uhlenbeck process 

Here, instead of FBM which is a process having stationary increments, we consider a stationary Ornstein- 
Uhlenbeck process which is a Gaussian stationary process with covariance r(t) := exp(— and therefore 
with spectral density /(^) := a(7r(a^ + ^^)) ^ . In such a case, since the spectral density is an analytic 
function, there exists a more accurate nonparametric estimator (see for instance, Ibragimov, 2004). However, 
to our knowledge, the case when paths are observed at random times is not considered in the literature. 
The results of simulations are provided in Table 3. 

Comments on simulation results: 
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1. The larger N the more accurate the estimator of / for all choice of random time 

2. The results are similar for Tl, T2, T3 and a less accurate for T4; 

3. Surprisingly, the case a = 1 is not clearly better than a = 0.1 despite the fact that the larger a the 
less correlated the process. 

4.3 Estimation of the spectral density of heartbeat time series 

Heartbeats of several working people have been recorded during 24 hours (see an example in Fig. 2). These 
data have kindly been furnished by professors Alain Chamoux and Gil Boudet (Faculty of Medicine, Occu- 
pational safety and health. University of Auvergne, Clermont-Ferrand). 



Fig. 2 An example of heart inter-beats during 24/i 



1 ooo - 




Actually, heartbeat is measured at frequency lOOHz. For physiological reasons, the duration between two 
observations should be between 250 milliseconds and 2 seconds. In this framework. Assumption S(cxd) would 
obviously be satisfied. Assumption S(s) with s < oo holds for instance for other physiological signals like 
EMG, EEC. . . 

Cardiologists are interested in the study of this signal in two frequency bands: the orthosympathetic 
and parasympathetic bands, i.e., the frequency bands (0.04 Hz, 0.15 Hz) and (0.15 Hz, 0.5 Hz) respectively. 
The definition of these bands is the outcome of research works, see e.g.. Task force of the European Society 
of Cardiology and the North American Society of Pacing and Electrophysiology (1996), and is based on the 
fact that the behavior of the energy contained inside these bands would be a relevant indicator on the level 
of the stress of an individual. 

Indeed, for the heart rate, the parasympathetic system is often compared to the brake while the or- 
thosympathetic system would be a nice accelerator; see e.g. Goldberger (2001). At rest there is a permanent 
braking effect on the heart rate. Any solicitation of the cardiovascular system, any activity initially pro- 
duces a reduction of parasympathetic brake followed by a gradual involvement of the sympathetic system. 
These mechanisms are very interesting to watch in many diseases including heart failure, but also rhythm 
disorders that may fall under one or other of these two effects, monitoring the therapeutic effect of several 
medicines including some psychotropic. In the field of physiology such data are crucial for measuring the 
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level of stress induced by physical activity or level of perceived stress, which can be considered as a criterion 
of overtraining in sport. 

We decompose these data in 3 temporal zones following the activity: 

• Quiet activities {t £ [1,28000] in seconds); 

• Intensive activities {t £ [28000,51400] in seconds); 

• Sleep {t G [60000,83400] in seconds). 

Applying the spectral density estimator on those 3 sub-data sets and plotting its log-log representation for 
frequencies in [0.02, 1] Hz, we observe that: 

• in zone "Sleep" (see Figure 3), only one regression line could be computed for frequencies in [0.04, 0.5] 
Hz which is the usual spectral interval considered by specialists; in this zone H ~ 0.99; 

• in zone "Quiet activities" (respectively "Intensive activities"), (see Figure 3), two regression lines could 
be drawn for frequencies in [0.04, 0.5] Hz, distinguishing the orthosympathetic and the parasympa- 
thetic spectral domains. Using an algorithm computing the "best" two regression lines (see for instance 
Bardet and Bertrand, 2007b), one obtains that H ~ 1.34 (respectively H ~ 1.44) in the orthosym- 
pathetic domain which is [0.04,0.09] Hz (respectively [0.04,0.11] Hz) and H ~ 0.89 (respectively 
H ~ 0.79) in the parasympathetic domain which is [0.09,0.5] Hz (respectively [0.11,0.5] Hz). 

Fig. 3 Log-log representation of the spectral density estimator during "Sleep" zone (upper), "Quiet" activ- 
ities (middle) and "Intensive activities" zone (lower) 



5 r 
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Conclusion 

In this paper, we have constructed and studied an estimator of the spectral density / of a Gaussian process 
from discrete samphng at instants (t ^. One of the main novelties of this work is that the sampling 

scheme is random: t[^-^ — t[^i = 5nLi for a sequence of r.v. Li and a sampling step 6n — > 0. Under moment 
condition on the r.v. Lj, we have obtained a CLT for sample variance of wavelet coefficients. Then, by 
using the same wavelet-type technique with a bandwidth ^ cxd, we have obtained a pointwise estimator 
of the spectral density at some frequency ^. Under the conditions: A„ = o((n(5„)^/^) and (n5nY^^ — o{^n), 
we have a CLT with a rate of convergence (n(5„)~^/^. The moment condition on r.v. Li is linked to the 
regularity index of the process X. This pointwise estimation of the spectral density is then applied to 
heartbeat time series. With this tool, we can observe variations of the spectral density according to the 
type of activity of human beings. 

Three directions for future research have been opened by this work. Firstly, our results could be extended 
to non-Gaussian processes, i.e. when another measure replaces the Brownian measure in formula ([T]). Thus 
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the expression of the spectral density estimator will be the same a non-Gaussian frame. However certain 
conditions will be necessary to establish CLTs similar to those obtained in theorem [1] and Proposition [2j 
Secondly, it could be interesting to make more investigations in order to compare our sampling scheme with 
Masry's conditions. Finally, from a practical point of view, we believe that our estimator has potential 
applications for other kinds of real data. 
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Appendix 

Proofs of useful lemmas and Proposition [1] 

In the sequel, the following lemma will be useful: 

Lemma 1 Let X be a Caussian process defined by ([T]) with a spectral density function f satisfying (0) or 
by ^ with a spectral density satisfying Then there exists a constant Co > such that 



lE{X{ti)X{t2))\ < Co(l + ltil)(l + lt2l) forall{ti,t2)£]R^. 



(23) 



Proof. Firstly, let us consider X defined by ([T]). For all t £ M, hy using ([2|), we have 



lE[X\t)] = I e**€ - if fiO di<2 [ Itel' /(e) d^ + 8 r f{0 d^ 
Jm ' Jo Ji 
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This implies ]E{X{tf) < Co (1 + \t\^) where Co = 4 /jj (l A f{0 d^- Then, by using Cauchy-Schwartz 
inequahty, one deduces (f23]) . Secondly, consider X defined by (jl]), then ([2]) and dS]) imply that 



\]EiX{ti)Xit2))\ 



: [ fiOdC < oo. 
Jm 

Therefore (j23p is satisfied with Co = / /(O ^^C- This finishes the proof of Lemma [TJ ■ 

Jm 

Proof, [of Proposition [T] We just give the proof for Gaussian processes defined by ([T]) with a spectral 
density function / satisfying The key property, which explains that the same representation formula 
holds for for Gaussian processes defined by ([T]) or is Formula (|24|) . Moreover, since Condition ^ implies 
Condition ([3]), all the convergence results remain valid under Condition 



Let X be defined by ([I]). Firstly, one can show that for all a > and b £ M, IE 



\dx{a,b)\' 



induces that dx{a,b) is well defined. Indeed, since X is a real valued process, one has 



< cxD. This 



IE 



\dx{a,b)\ 



1 

JM JM 

Co 



a 

ti-b 
a 



^ ( ]lE{X{ti)X{t2))dtidt2 



t2-b 



il + \tl\){l + \t2\)dtidt2 



(n)| (1 + \b\ + \au\)du I < oo. 



< 

a Jm Jm 

where we have used successively the bound (f23]l . the change of variable u = {t — b)/a and the second 
condition of Assumption W(l, 1,0). Next, one turns to the proof of the representation formula Q. Firstly, 
recall that the stochastic of a complex valued function g = gi + ig2 against a complex Gaussian measure 
W with real part Wi and imaginary part W2 is defined by 



M 



g{x) dW{x) 



M 



gi{x) dWi{x) - j g2{x)dW2{x 



M 



and that Wi and W2 are Wiener measures, see [311 (7.2.8) p. 326]. Now, consider any interval [a, A\ c]0, oof 

't-b^ 



the function / is bounded on [a, A\ and 



M 



dt = a 



(u)\ du < 00. Therefore, one can 



M 



apply the Fubini-type Theorem for stochastic integral (see [201 Lemma 4.1, p. 116]) to the two integrals 
corresponding to the real and the imaginary part, then by summing up, we get 



M 



dt 



Since / ip{u) du = 0, for all couple (a, b) g]0, oo[xiR, we have 
Jm 



e^t^^l LJ ] dt = ae'^^iPiaO. 
M V « 



Thus, for all couple (a, 6) G]0,oo[xiR we have 



a r ^'^i^{aOf''\i)dW{i) = / vf^Mr(e'*«-l)-/'/'(e)dH^(0 



dt 



(24) 



(25) 
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From the one hand, the first condition of W(l, 1, 0) and ([3]) imply that / 4^{a£,) /(^) d£, < oo. There- 
Jo 

l-An — 

fore, one can deduce that for any sequence of couples (a„, An) converging to (0, oo), the sequence / e^"'^^ ipia-C) /^^^(^ 
converges in L^{Q). From the other hand, 



. t-b 



An 



1/2, 



dt 



converges also in L'^{Q), because 



IE 



, , t-b 







dt 



Indeed, firstly, by using Cauchy-Schwarz inequality, we get that 

2 







1 



1 f{i)di 



< 







1 



1/2 

/(O di ] X 

1/2 



2 




< OO 




(f 


e^*2« - 1 







2 \ V2 

f{i)di 



(rl poo / fl /•oo 

j^\ti^\^ f{i)di + A fiOdCj xi^J^M'fiOdC + ^J^ fiOd^ 

< ((4 + tl) JjlA ICP) . fiO X ((4 + tl) JjlA ■ fiO d^ 

<(2 + ti)x(2 + t2)x / (lAleP) -/(OdC- 



1/2 



Next, by using the isometry property we get the following upper bound 

2 



IE 



, .t-b 



e^'^-l]-rr^{OdW{0 



1/2/ 



dt 



MJm. \ a ) \ a 



< 



< oo, 



^ 1 A • /(O '^^'^ jj^jj (^) ^ (\^) (2 + *i) X (2 + ^2) dti dt2 

t-b- 



[i/\m-f{Od^ 



XI I (2 + t) 



dt 



where the last bound follows from Condition ^ and Condition W(l, 1, 0)). Eventually, one can pass to 
the limit in (j25p which provides 

't-b' 



/•oo 

a / e^'^ipiaOf/'iOdWiO 
Jo 



e''^-l]-f''\OdW{i) 



dt. 



But similar calculations would lead to the same result between the bounds — oo and 0. By adding the two 
integrals between and oo and between — oo and 0, one can obtain 

't-b' 



a / e''"'i^{ai)r\i)dW{0 



IR \ a 



M 



e''^-l)-f/HOdW{0 



dt 



dt 
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which imphes ([9]). Afterwards, formula ([9]) implies that for all a > 0, 6 G M, dx{a, b) is a Gaussian centered 
random variable with variance Ti{a). Moreover, for all a > and 62) G JB?^ we have 



IE [dx{a,bi)dx{a,b2, 



ale 



ia{bi-b2)^ 



Thus for a given a > 0, IE {dx{a, bi) dx{a, 62)) is only depending on (61—62) which induces that {dx{a, 6))bgjR 
is a stationary process. This finishes the proof of Proposition [TJ ■ 

From formula ([9]), it is clear that for all (01,02) G (0, co)^ and for all (61,62) S M^, 
iE(dx(ai, 61) • (02,62)) = V«i«2 • 7(^2 - 61,01,02) 



where 



7(6*, 01,02) : 



e^^«V'(oiOV'(o2e)/(0<^e. 



M 



(26) 



When (oi, 02) are positive numbers, the function 7 and its first derivative with respect to 6 can be bounded: 



Lemma 2 Under Assumption W{1, 0,1/2) and if f satisfies (0) and Assumption F(H) with H > 0: 

1. for all (01,02) G (0, 00)^, there exists C > not depending on 9 such that, \'^{9, oi, 02)] < C (l A |^|^^) 
for all OgM. 

2. the function 7 is derivable with respect to 6 and for all (01,02) G (0, cxo)^, there exists C > not 



depending on 6 such that, |7'(0, oi, 02)] : = 



— (6*, 01,02) 



< C {l^\e\'^) for aiiee m. 



Proof, [of Lemma [2] Firstly, from Assumption W(1,0, 0) (induced by Assumption W(1,0, 1/2)), there 
exists a constant c > such that 



<c(lA|^p) for all ( £ JR. 



(27) 



Indeed, from one hand, |V'(0| ^ ll''AllLi(iR) < oo- From the other hand, ip G W{1, 0, 0) implies that ip is twice 
continuously differentiable and ■0(0) = '^'(0) = 0- From Taylor-Lagrange Formula, for all G JR* , there 
exists ^0 G with 1^0 1 < |^| such that ^^(0 = 7:fx^"i^o)- This induces 1 1^(0| <^|Cpx( / t'^ \ip{t)\dt 



providing the second bound of ([2 
Secondly, we show the first item. Inequality (j27p implies that 



^(oO f{Od( < H\^fiOdC+ I fiOdC 



< c2(lVo4) / {lAe)fiOd(<C, 
with C > not depending on 9. From Cauchy-Schwarz Inequality, 

jie,ai,a2)<c'{lVal){lVal) [ (lAC')/(0^^e 
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Combined with 1^, this means that j{9, ai, 02) is bounded by a constant. Moreover, with /(w^) and /(w^ ) 
denoting the right and left hmits of / at Wy^, for all 1 < k < K — 1, 6 £ M* and (ai, 02) G (0, 00)^, 



— I e 



'/('^fc+i)V'(ai^fc+i)^(a2a;fc+i) - e'''^'^>= f{uj+)i){aiujk)i'{a2UJk 



/'(e)V^(«iOV^(«2e) + /(6(«i#(«iOV^(a2e) + a2V^(aie)^'(a20 



The same result remains in force for k = and k = K. Indeed, by using (j27p combined with Assumption 
F(i?), one deduces that for all 9 £ ]R and (01,02) G (0,oo)^, 



lime^^«/(e)V(aie)V'(a2e) = 
Moreover, since / is an even function. 



and 



hm e^^«/(e)V(aiOV'(«26 = 0. 



1 



/'(0^(ai6^(a20 + /(0(«iV^'(aiOV^(a2e) +a2V^(aiOV^'(a20 



Thus, by summing up and using Assumption F(if), for all G iR and (oi, 02) S (0, 00)^, 



K 



1 



^ (e*^-4(ai^fc)^(a2C^fc) - e-*^-'^-^(-aia;fc)V^(-a2a;fc)) (/(^+) - /(o;^)) 



fc=i 



/'(e)V^(aiO V^(a20 + /(O ( aiV^'(aiOV^(a20 + a2V^(aiO#(«2e) 



+- 

since the integral of the r.h.s. of the previous equality is well defined. Then, 

K 



/K-)| 



k=l 



+ 



[\f'{0kai0ka20\ + l/(OI(|ai||V^'(«iOV^(«26| + 1^21 \kaiOi^'{a20\) 



dC 



It remains to show the convergence of the previous integral. Using the same trick as in Formula (|27p . under 
Assumption W(1,0, 0), {ip'iOl < c' (l A |^|) with c' not depending on ^. So, for ah (01,02) G (0, 00)^ 



liR 



dC 



|/'(0^(aie)V'(a20| + l/(OI(|ail|V''(aie)V'(«20| + 1^21 |V'(aiO#(a20 1 

<Cia?a2 / 1/(^)1^4 ^2|/(Oe'|de + C2 / |/'(6I + (l«i I + |a2|)|/(OI 
J|5|<i J|5|>i 

<C [ \{lA\C\').\f'{0\ + {lA\^\').\f{0\\d^ 
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where C > depends on c, c', ai and 02. But since (l A |^|^) < (l A and (l A < (l A then 
from Assumption F{H), fj^ (l A • < 00 and /^j (l A • |/(^)| < 00 and this completes the 

proof of the first item. Eventually, one proves the second item. The differentiability is obvious and 



Jm 

Assumption W(l, 0,1/2) implies that for all a > 0, |a^|^''^|V^«)| < for all £ JR. Combined with 
()27p . this induces that for all a > and 6 £ M, 

\ip'{e,ai,a2)\ < [ |eilV^(aiOII^(«2e)l/(OdC 
Jm 



< c, 



with C > not depending on 6. Using the same arguments as above, for all 6 G M* and (01,02) € (0, 00) , 

K 



j'{d, oi, 02) = -7 X] (e*^'^''''^fcV'(ai'^fc)V'(a2'^fc) + 

k=l 



1 



e 
M 



(«i V''(aiO^(«20 + a2V'(aiOV''(a2e) 



and therefore |7'(6', oi, 02)! < t-tt, with C > not depending on 6. This finishes the proof. ■ 

Asymptotic behavior of sample variances of wavelet coefficients for continuous time 
processes 

Since Xi(a) is obviously defined from \dx{a^ 6)|^, we begin with the study of 

1 " 

iJa) := Cfc)|^, for a > and n G W*. (28) 

n + 1 ^-^ ' ' 

fc=o 

For n G M* and a > 0, define also: 

,2 



(29) 



Note that depends on iE(cfc — q) and therefore its formula is valid when (c^) are r.v. However, we 
begin by proving the following proposition in the case of deterministic (c^) and of course -E(cfc — q) can be 
replaced by Cfc — Q in ([29]). 
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Proposition 3 Let X be a Gaussian process defined by ([7P or ^ with a spectral density f satisfying ip 
satisfying Assumption W{1, 1, 1/2). Then if {ck)k is a family of real numbers such that ci < C2 < ■ ■ ■ < c^, 
n maxi<fc<„{cfc+i — Cfc} — > oo and there exists a constant C" > such that for all n G ]N* 



then Va > 0, 



max {cfc+i - Cfc} < C" min {cfc+i - Ck} < oo 

l<k<n l<k<n 



^ {In{a)-Zi{a)) ^ AA(0,1). (30) 



Sn{a 

Moreover, there exist < Cm < Cm not depending on n such that Vn G N* , 

Cm < Sn{a) in max {ck+i - Cfc})^^^ < Cm- (31) 

l<fc<n 

The proof of Proposition [3] relies on Lemma [2] and the following Lemma which is a Lindeberg CLT (see a 
proof in Istas and Lang, 1997): 

Lemma 3 Let (XN,i)i<i<N,N&]N* be a triangular array of zero-mean Gaussian r.v. Define Sf^ := var{VN) 
with Vn := Y.i=i^Ni ^"■^ I^N ■■= max y^\cov {YN,i,YNj) |. // lim — = 0, then S'^^{Vn - 1E{Vn)) 

' l<i<N ^ — ' N^co cij\f 

converges weakly to a standard Gaussian random variable. 
Proof, [of Proposition [3] 

Consider Yn^i = (n + l)^^^"^ dx{a, Ci) ioi i = 0, ... ,n and 

f /?„ = (n + 1)"^ maxi<i<„ | J^]=o |cov ((ix(a, q), dx(a, Cj)) ||, 
\ Sl = (n + l)-' Z^^,ZU^ov {dl{a,c,),dl{a,c,)) 

But, by using Formula (l26|), V(a, 01,02) G (0,oop, (61,62) G -K^ 

GOV ((ix(a, 61), (ix(a, 62)) = 07(61-62,0,0) 
GOV ((i|:(oi,6i), ^^(02,62)) = 2(0102)7^(61-62,01,02), 

sinoe variables dx{a,b) are zero-mean Gaussian r.v. Therefore, 

f /?„ = o (n + 1)"^ maxo<i<„ | Yl]=o |7(ci - Cj,a, o) || 
\ 52 = 2o2(n + l)-2^r=oE-=o 7^(^-9, a,a) 

Let p and q be sueh that 1/p + 1/g = 1 with (p, q) G (1, 00)^. Then the Holder Inequality implies that 

Pn < C an^^"^"^ X raax | |7(q ~ cj, o, o) ^ |. 
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Lemma [2] 1) implies that for every 1 < i < n, for n large enough, 

n n 
I p / 

^ 7(ci-Cj,a,a) < C f# {O < j < n, |cj - Cj| < l} + ^ |cj - Cj|~^l 
j=0 j=o 

< C(#{0 < j < n, |i - j| min |cfc+i-Cfc|<l} 

\ 0<fc<n— 1 



Ci-Co >1 



n _^ 

+ y~! ^^'^ |cfc+i-Cfe| 1 I I ) (32) 



j=0 

<2C( min |cfc+i-Cfc|)-Vl + ( min |cfe+i - d)'"^ ^A. (33) 

0<fe<n— 1 \ 0<K<n— 1 — ' / 

£>(maxo<fe<„_i |cfe+i-Cfe|)-l 



Since p > 1, |^| ^ < cxo is finite and thus 



^=1 

1 1 C" 1 

E\i\~^ < -( max |cfc+i-Cfc|)^ < ( min Icfc+i-Cfcl)'' , 

£>(maxo<fe<„_i |cfc+i-Cfc|)-i 

since by definition maxi<fc<„{cfc+i — c^} < C" mini<fc<„{cfc+i — Cfc} < oo. Therefore, 

-i/p 



(in < Cainx min \ck+i-Ck\} , (34) 

with C > not depending on n. Now, a lower bound for is required. For all a > 0, 6 £ IR ^ ^{9, a, a) 
is a continuous map and 7(0, a, a) = /^j [^'(aOl^ > 0- Therefore, for all a > 0, there exists 6*0 > 

such that j{9,a,a) > -7(0,0, a) when |^| < Oa- Then, 

Slia) > C[a^n-^-f\0,a,a)#{0<i,j <n, \ci - Cj\ < Oa} 

> C[a^n"^ 7^(0, a, a) #io < i,j < n, |i-j| max \ck+i - Ck\ < 9a\ 

K 0<fc<n— 1 J 

> C[ a^n"^7^(0,a, a) ^ ( (6*0 max |cfc+i-Cfc|) ^A(n-l)), 

Z \ 0<fc<n— 1 / 
MA(n-l) 

since for /i > 0, #|o < i,j < n, \i — j\ < /i| = 2 n — k > 2([/i] A (n — 1)) — . Thus, since from 

fc=o 

assumptions n max Ic^-^-i — c^l — > 00, there exists Cm > such that for n large enough: 

0<fc<n-l n~*oo 

Sl{a) > Cm (n max \ck+i - Cfcl) \ (35) 

Now, from (IMll and (l35D. 



— <Cn/ max |cfe+i-Cfc|j' ( mm |cfc+i-CfeP 



Sn 0<k<n-l^ ^' ^0<fc<n-l 

^1/2-1/p 

fc+1 — ^k\, 

a/2-l/p 



Therefore (5n/Sn < C in max |cfc+i — Cfel) with C > 0. Next for any p G (1, 2), 

0<A;<?i— 1 



lim (n max Ic/j+i — Cfc|) =0 and thus, lim I3n/ Sn = and assumptions of Lemma[3]are fulfilled. 
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Finally, from ([35]), Sl{a) > Cm ( 

n max \ck+i — Cfc ) with Cm > for n large enough. Moreover, 

0<A;<n— 1 



using the bound (|33j) for |j = 2 and the lines after (|33j) , 

n 

^ 7^(ci - Cj,a,a) < C ( min \ck+i - Ck\) 

n n 

=^ Sl{a) < C' n~'^y2'S^j'^(ci - Cj,a,a) < Cm{n max |cfc+i-Cfc|) . 

i=0 j=0 - - 



Therefore, inequalities (|3T]) are proved. 



Proposition 4 Lei X be a Gaussian process defined by ([2]j or ^ with a spectral density f satisfying 

tp satisfying Assumption W{l,l,r) with r > 1/2. Then if {ck)k is a family of r.v. independent to Tx such 

that Cfc = Co + ^(c„ - Co), with 

^-1 n^f \ n ^^^ri "Co) j / \ n 

n2r Jtj[Cn — Co) — > (J, — > oo and var [Cn — cq) — > U. 

?i— »oo log n n-^oo n—>oo 

Then iTM) /loWs wii/i 



lim (]E{cn- Co)) Sl{a) =4TTa'^ f \ip{az)\^ f'^{z) dz. (36) 
Remark 2 For (cfc)fe satisfying Iil3\) . under Assumption S(2), Proposition^ holds when n}!'^ bn — > 

n— >oo 

because lE\Tn — -2?T„p < "^(5^ maxi<fc<„ iEL^. 

Proof, [of Proposition |4] The sequence of r.v. {ck)o<k<n is independent to J^x- Therefore, {dx{a, Ck))o<k<n 
as the same distribution than {dx{a,Ck — co))o<k<n- Indeed for a sequence of deterministic real numbers 
ick)o<k<n, (dxia, Ck))o<k<n IS a stationary sequence and after {dx{a,Ck))o<k<n has the same distribu- 
tion than {dx{a,Ck - &o))o<A:<n- Next, conditionally to the a-algebra a{{ck)o<k<n), {dx{a,Ck))o<k<n as 
the same distribution than {dx{a,Ck — co))o<fc<n- Finally, since o-{{ck)o<k<n) and J^x are independent, 
{dx{a,Ck))o<k<n as the same distribution than {dxia,Ck — co))o<k<n- 
Now, we can only consider the case: c^ = kTn/n with t„ := c„ — co. Define 

1 " 

^-(«) •= -Y.'^x{a,lEck). 

k=0 

It is clear that {]Eck)i<k<n is a deterministic sequence. Thus 

Nowadays, one has to check that the error /^(a) — In{a) is negligible with respect to 5„(a) in norm L^(r2). 
But 

Sn{a) > Cm ■ {n max \]E{ck+i - Cfc)|)"'/' > C x (iEr„)"'/'. 
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Therefore, it suffices to prove tliat lim lETn x lE[{l'^{a) - In{a)f] = 0. Since the r.v. Cfc are independent 

n— »oo '- -' 

on J^x, one gets 

lE[{l'^{a) - /„(a))2] = lE[]E[{l'^ia) - In{a)f \ J'x] 



1 



(n + l)2 
2a 



]E\]E[{d\{a,]Eck) - d\{a,Ck)){d\{a,]Eck>) - d\{a,Ck>))\J'x] 



k,k'=0 
2 " 



^ 7^ [lEck - lEck' ,a,a) - 7^ {Eck - Ck' , a, a) 



k,k'=0 



-7^ (cfc - ]Eck' ,a,a) + 7^ (cfc - Ck' , a, a) 
Next, from Taylor expansions, 

-y'^ {Eck - Ck' , a, a) = ^"^ {lEck - lEck' ,a,a) + 2[lEck' - Ck') x ■ ■ ■ 

/ ji^lEck — lEck' + A {Eck> — Ck>),a, a)j' (^lEck — JEc^i + A {lEck' — Ck'),a, aJdX 
Jo 



[ck - lEck' , a, a) = ^"^{ck - cy ,a,a) +2{ck' - lEck') 



X • • • 



l[ck - Ck' + X {lEck' - Ck'),a,a) 7(0^ - c^' + A {Eck' - Ck'),a,a) d\. 
Prom LemmaEl 3C > such that 17(6*, a, a)j'{9, a,a)\ < C x (l A 9~'^) for ah 6 £ M. One can deduce that 
7^ {Eck - Eck' , a, a) - 7^ [Eck - Ck' ,a,a) - 7^ (c^ - Eck' ,a,a) + 7^ (c^ - c^' , a, o) 

< c|c,.-iEcfe,| X [\i/\e-l,,{x)) + {iAe-l,,{x))dx 



with 9i^k,k'W = JE{ck - Ck') + X{Eck' - cy) and 62,k,k'W = Ck - cy + X{Eck' - cy). 
Then, 

E[{l'^{a) - In{a)f] < 2a^ x (^n + C:r2) , 



(38) 



1 

where, for i = 1,2, Stj := I E 

lo 

ETu 

n 
C 



-^-2 ^ \ck' - Eck'\ X (1 A 0-fe%,(A))] dX. Thus 0i,fc,fc'(A) 



- AA;'z„) with 5^ := 



and : 



k,k'=o 

Tn - Et, 
ETn 



■. Then, using 5'^ — > 0, for n large enough. 



E 



"L(n+1)2 
1 



< / E 
lo 



\ik'6'JZn\ X (lA[6'^{{k-k')-Xk'Zn)Y 

k,k'=0 



dX 



{JETnY Jo 



dxdy\yZn\x\\A\(x-y)-XyZr\ \ 



dX. 



But, for all A G (0, 1), one has 



{ETnf 



JO 



\y\ X [1 A [{x - y) - Xy Zn] )dxdy 



1 /.I 



Et„ 




X {I A {ETny"^ [{u - v) - Xv Zn\ '^ )dudv 

/'Z f'QO 
I (1 A {ETny^s'^) ds dt < 4. 



'0 JO 

1-2 roo 
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Therefore £ri < 4iE|z,i|. Now, using the same method for (B-X2, one obtains, 

lETn X lE[{l'^{a) - In{a)f] < ClETnX ]E\Zn\ 

< C(var (c„ - co))^''^ 
— > 



from assumptions. This induces that CLT (|30j) holds. 



Now the asymptotic expansion (|36|) can be proved. Consider first the deterministic case and since 

l_e»("+i)°2 sin2((n + l)a/2) 



fc,fe'=0 



{k-k')a 



1 - e*' 



sin2(a/2) 



then. 



2a2 



(n + l)2 

(n + l)2 
16a2 



fe,A:'=0 



iR2 



JR2 



.,^|2 ,..,sSm 



2 Cn-CQ n+1 _ ^/^"^ 
) / r- r-n / >. 

sm' 



2 n 



In 



Cn - Co J ml 



|V^(a/)|V(^') i>(a{z' + 



2z 



Cn - Co 



f z' + 



2z \ sin2 fi^z 



Cn-coJ n^sin^ (^) 



?T-ldzdz'. 



sin (^^iix) sinx 
Let us define /in(x) := . and h{x) := . For aU {z,z ) G M^, 



'tjj(a{z' + 



nsm 



2z 



X 



f[z + 



— > |^(a2;')| and — > h^{z). 

Cn Cq / V Cfi Co / n— ^oo n— >oo 

However Lebesgue Theorem cannot be apphed. Denote v{x) := \Tp(ax)\'^ f{x) for x > 0. From Assumptions 
F and W(l,r) with r > 1/2, is a differentiable function in (0, 00) and 3C > 0, Vz',x > 0, {i^'iz' + x)| < 
C\u'{z')\. Then, 



2z 



v[z + 

Cn - Co 

Moreover, \hn{^z')\ < 1 for all G and 



) - K^') 



< 



2z 



Cn - Co 



hl{z) dz 



-I fn 



2 ^ sin (2 {k - k')) J_ " 



n — ' ik — k') 



A:=0 



Therefore, 



hl{z)dz = 2\ Y,^-^ 



+ 2 = 7r. 



(39) 
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2 ; " 

Since - }2i<k<n k- 



{2k) 



< 



and from Dirichlet Theorem, x — tt = — 2^„>]^ sia{nx) ^ 



X G (0, 27r). Now, for z' > and n large enough, 
2z 



v[z' + 

iR I C„ - Co 



)h^{z)dz — v{z') / h\{z)dz 



^ 2C\v{z')\ r . , , ^ r / , 2z .2 



< 



< 



< 



Cn — Co Jo 

2C\v{z')\ r 4sin2(z) 



Cn - Co ' 



h^[z)dz 



Cn ~ Co JO 

8C|t;(zO| 

Cn - Co 

8CKz')l 



dz + viz + 



2z 



Cn - Co 



) dz 



Cn ~ Co \ JQ 

under Assumption W(l, But when r > 1/2, 

c„ - Co 



/ zdz+ I -dz^+Cf{z') ^(a{z' + 
/ zdz+ -dz) +Ca'^'f{z') — 

Jo Jl Z J Jn (1 



2z 



Cn - Co 



dz 



dz 



1 



(1+ z' + 



2z ^^2r 



Cn-CQ 



dz < 



2n/{cn-co) 



r2r 



2{2r - 1) ' 



and therefore there exists C > such that for all z' > and for n large enough, 

2z 



v(z' + 

JR+ Cn- Co 

< C 



)h'^{z)dz — v{z') / hf^{z)dz 



1 + log(n) 



Cn - Co V n^"27 ^ 



2r 







under assumptions of Proposition [H Finally, with (|39p in mind, one deduces that for all z' > 0, 

2z 



lim / v{z' + 
'1R+ 



■) h'i{z) dz = lim i;(z') / h^{z) dz = — v{z'). 

Cn - Co' n^oo 2 



Therefore, using the same method in M- as in iR+, one obtains 



ip{az')ff{z') f i,(a{z' + ^^ 



f[z + 



sin^(z) 
Cn - co^ n2 sin^ (f ) 



dz dz' 



TT / Waz')\''p{z')dz', 



providing the asymptotic behavior of 5^. The proof is similar in the stochastic case with Cn — cq replaced 
by iE'(cn - Co). ■ 

Proof of Theorem [1] 

The proof of Theorem [1] uses the following lemmas: 
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Lemma 4 Let X be a Gaussian process defined by HP with a spectral density f satisfying ^ and Assump- 
tion F(H). Let us define, 

R{t, u, t', u') := IE \{X{t + u)- X{t)) ■ {X{t' + u') - X{t')) \ Tx 

for {t,t') E M^, {u,u') G M^. Then 3Cf > depending only on the spectral density f such that for all 
{u, u',t, t') e Mix M^, with (3= {t' -t + u' -u), 

I R{t, 2u, t',2u')\< Cf (u v! + (n u') ^^^^^/^ |/?| ^^Ho<H<i 



Proof. To begin with, remark that for all {t,t') G iR^, {u,u') G 



■+' 



R{t,2u,t',2u 



Jm 



M 



PCX) 

= 8 sin«) • sm{u'C) ■ cos (^(t' - t + u' - u)) /(C) 

= 8(/i+/2) 

with Ii := J^^ ■ ■ ■ d^^ and I2 = • • • d^. From the one hand, with | sina| < \a\ and | cos a\ < 1, 



I Ii I < uu 



' / ef{i)di <Cuu\ 
Jo 



where the last inequality follows from ([3]). Now, H > 1, then the same bound can be extended to I2 since 
< CC^-^^ and ^^-^^d^ < 00. Therefore if H > 1, 



\R{t,2u,t',2u')\ < Cfuu' 

with Cf only depending on /. 

If < < 1, firstly one obtains with a change of variable, | cosa| < 1 and | sina| < (1 A \a\ 



I/2I < 



sin{u£,/Vu u') sin{u(,/Vu u')\ f{^/\/uu')dS, 



fOO 



< {uu')" / \sin{u^/Vuu')sin{u(,/Vuu')\C d( 

Jo 

< {nu')^[j\-^^^Hi + P C^'^-Ui) 

< Cniuu'f, 

with Ch > depending only on H. Secondly, with /3 = (t' — t + u' — u) and an integration by parts. 



(40) 



sin (uC) sin (u'C) cos {p^)f{C)d^ 



-1 



sin {u^) sin{u'^) sin 



Sin (/? ^ ( sin {u sin (n' ^) / (C) ) dx) , 
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where Assumption F{H) insures the convergence of bracket term at oo. Using again Assumption F{H) for 
/', changes of variables, | cosa| < 1 and | sina| < (1 A \a\), 

I2 < CfUu' + CoP^^ / I sin (/3 I ( u I sin (ti' x) I + n'l sin (u I j x~^^~^(i2; 

(/? x) sin x) sin (n' x) 



sm I 



-2H-2 



dx 



< CfUu'-\-CoP ^{uu')^ J I sin{f3 x/Vuu') \ (^u | sin (^u x/Vuu')\ + u'\ sin (ux/V uu'^\j 

I sin (/?x/\/ uu') sin x/Vuu') sin [ux/Vuu')\j x' 



„-2H-l 



dx 



-1 



2H-2 



dx 

< CfUu' + Co{2{uu'f I x-^"+^dx + p~\u + u'){uu')^ I x-^^-^dx) 

Jo Jl 

+ Cl,{{uu'f / X-^'^+'dx + p-Huu'f^'/^ / X-2^-2^x) 

< Cf{uu' + {uu'f + p-\u + -uOCnn')^ + p-Huu'f+^/^), 
where Cj > only depends on /. Therefore with (j40p . for < < 1, 

I/2I < A (^-H'" + ^^')(^^^^')'^+^"^'"^')■^^^^^)■ 

But the inequality (x A y) < x"y^~" which is valid for all x,y > and < a < 1. Applied to previous 
inequality with appropriated choices of a, one obtains I/2I < C {uu')^^^^^^'^ (3^~^ . This completes the proof 
of Lemma 21 ■ 

Let en{a,b) be the error between the wavelet coefficient and its approximation, i.e. 

e„(a, b) := dx{a, b) - Cxia, b). 

The error 6) contains three different terms. The first one corresponds to the replacement of the integral 
onto the interval [0, r„] by its Riemann sum. The second and the third ones correspond to the replacement 
of the integral onto M by the integral onto the interval [0,Tjv]. More precisely, one has 



e„(a, b) := ei,„(a, b) + e2,n(a, b) +£3,™ (a, b), 



(41) 



with 



ei,n(a, 6) 


:= a-'/' 


£2,n{a,b) 


:= a-V2 


e3,n(a, 


:= a"V2 



/•T 4. i n— 1 „j( 



) V a 



V a 

n 



X{t) dt, 
X{t) dt. 



The following lemma give bounds on ]E\ei^n{o,, k)\ for i = 1,2, 3. 
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Lemma 5 Let X be a Gaussian process defined by (OP with a spectral density f satisfying and As- 
sumption F(H). Assume also Assumptions M^(l,3,0) and let {pi,P2) £ [IjOo)^ and (^1,^2) be defined by 
^ + ^ = 1 for i = 1,2. Then, there exist positive constants Ci, C2 and C3 depending only on f, Omin, 
o-max! and pi, p2, such that for any r.v. b independent on Tx md satisfying < b < Tn — with 
p > 1/2, we have 



(42) 



1=0 



n-l 



i=0 



(43) 

2-£+2/p2 



-25^(|£j,n(a) ^) 1^ I ^x) 1^ C2 a^T^ for n large enough and i = 2,3. 

Remark 3 When H = 1, the term di'^^^^^^^ = S^r^"^'^^ should be replaced 6?/ In (5„ x 5^+^^^" = O [5: 
for all £ > 0. Thus, if H = 1, it suffices to replace H by 1^ and formula remains valid. This convention 
will be adopted in the following, in order to lighten the notations. 

Proof. (1) Bound of iE(|ei,„(a, 6)|^ | Tx)- To begin with, 



iE(|ei,„(a,6)|' I Tx) 



1 An) An) 



i+1 fS+i /t — b\ , / t 



(n) l^(n) 

i 



i,j=0 ■'^ 



n-l n-l 



a 



lEi{X(l^)-X{tf~^)){X{^)-X{tf))\Tx)dtdt! 



i=0 j=o " i-i 



+1 f j+1 , ft — b-^ 
a 



(n) l_f{n) 



^(^)^(^)i?(tS"\t - t^,t^,^ - t!f )dtd£. 



Lemma H with 2u = t- , 2u' = t' - tf and /3 = tf - if ^ + i (t' - tf )-^{t- tf^ ) = \{t' + if ^ -t-t 
impHes 

IE{\ei^n{a,b)\^ \ Tx) < C/a-^ (5i + ^2 W<i) 



with 



Set 



Si 
S2 



1 



ij=0 



(n) /^(n) 

i 

1 An) An) 



UU 



t-b 



t'-b 



dtdt' 



E 

jj=0 



(n) /^(n) 

3 



t-b 



t'-b 



dtdt'. 



n-l 



i=0 



(44) 



We have Si = J^" xit) 
we have 



t-b 



dt ) , then from Holder Inequality, for {p, q) G [1, 00]^ with 1/p+l/q = 1, 
t-b^ 



Si< 



2 

ra • 
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Obviously 



t-b 



From the other hand, for p < oo 

n—l 



X 



HI 



(/^x'"W*)'" = (/^Ei„<".,.<;.,(*)l«-4"'l'"*) 

n-1 

(i:/,:ri-4"r'*) 

1=0 



i=0 

With the convention l/oo = 0, tliis result remains in force for p = oo. It follows for all 1 < pi < oo, 

|2 n—l „, 

, 2/pi 



51 < 



(pi + 



(45) 



i=0 



Next, in order to bound ^2 for < < 1, write 



0<i<7<n— 1 i J 



0<j<j<n 



where u = \{t - tf^), v! = \{t! - if ^) and /3 = - n. But since ^ = \ {{t' + tf'') - {t + ti)) for i < j and 



t e t' e [^r'^J+i] l/^l ^ h\t'-t\- Therefore 



(n) (n) 



0<i<J<n-l'^*< 



< 



t-b\ n'-h 



'-)\t-t'\"-^ 



for any (p, G [1, cxd]^ with l/p + l/q = l. But for all p > 2 



CP {IB?) 



i=0 



Next, with n = (t — c^) /a and v = [t' — Ck)/a, one gets 



9 ^ Q2+g(l-i?) /" IV^NV-WI' 



dudv 



<a^+^(-^)( / |^(.)^(.)rd.d. + / ^^^^^^^ 



dud?; , 



2g , 

-I- 



The last integral is equal to (l — g(l — i?)) ^ when p > 1/H. Thus, for all p2 > ^/H, 



So < 



12 



nH-3+2/p2(-i . r,„n -u 



l/'?2 



(46) 
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Finally by summing up ([15|) and ([16]) . one gets the bounds of IE(^\ei^nio-,b)\^ \ J^x)- 
(2) Bound of iE(|e2,n(«, | ^x)- Since T„ is independent on J^x-, 



iZ?(|e2,n(a,&)|' I ^x) = - / / 4 )M- -]]E{X{t)X{t'))dtdt' 

Qj frn I rp \ CL / \ Qj / 

't-h\ - - n2 



< -I 



'l + \t\)dt 



from LemmalU But, according to Assumption W(l,3), (1 + is a bounded function and 

' {l + t){l+{t-b)/a)-'dt) . 

If > 1, then 1 -\- {t — h)/a < 1 + (t — + Tn)/a for all t > and with the change of variable 

V={t- Tn + TP)/Tn, 

/ < Tntt-^ o -dv 

Jt„ (1 + (t - b)/aY Jt^' [a+v TnY 



If Tn < 1, by using b < r„, 



T„ (1+ (t-6)/a)'' Jo (l + v/a) 



dt < I ^ (if; < a + - . 



Eventually, one deduces the bound of iE(|e2,n(a, ^)|^ I ^x)- 

(3) Bound of iE'(|e3^„(a, | Tx)- Find a bound for iE(|e3^„(a, | follows the same steps 

than for bounding iE'(|e2,n (a, 6) ^ | J^x)- B 



Lemma 6 Under assumptions of Lemma and if s > 2 + [l — 3i?] ^ and if 



< d <1, 



5^ = n""* with V (— 

\l + s{2HAl)^ ^s{2 + {H Al))-lJ 

then for all a > 0, 

{n5„)lE{\enia,Ck)f) — >0 forallO<k<n. (47) 

n^oo 

Proof. With (x + y + z)^ < 3 (x^ + + .z^) for all real numbers x, y, z, 

lE{\en{a,Ck)f) < 31E[lE{\ei,n{a,b)f \ ^x)] + 6iE[iE(|e2,n(a, | ^x)]- 
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Then using Lemma O with pi > 1 and p2 > ^/H, an optimal choice of pi,P2 depends on s. Hence, since 
< {]E\Z\)'^ for any r.v. Z and a £ [0, 1] from Jensen Inequahty, 

1. if 3 < s, with 1 + pi = s, 

j=0 

2. if max(2 + F, I + < s, with 1 + ^^2(1 + -H") = s, 



i=0 



However, these inequahties may be extended to smaller values of s by using the sharper inequality iE( 

< n°(^^niax when (a,/?) G (0,1]^ and therefore for r > 1, iE(^L[)"^ < 

max ]E{\Li\^^))"; it is then possible to choose r/3 = s. Thus, 

Cj<n— 1 



n"( max 

■ 0<j<n- 

1'. if 2 < s < 3, with aj3 = ^ and s = P + ^, the best possible choice is a = 1 and /? = s — 2, 

n— 1 2 « 

2'. if > 1/2, and l + i?<s<2 + i?, with a/3 = ^ and s = /? + the best possible choice is 

a = 1 and /3 = s - (i7 + 1), 

n—l 



i=0 



-'ri' 



2". if < i7 < 1/2 and ^ + 2F<'^-i + 2F' ^^^'^ = ^ ^"^"^ s = /? + the best possible choice 

a = 2H and P = s- 



j=0 

We finally obtain for n large enough and using n5n — > 00 and n5f^ — > 0, 

n— »oo n— >oo 

. [iH>l/2,]E[]E{\ei,n{a,b)\^ \ J^x)] < C {n5'^ 

^2<s<2+{HAl) {HAl)<s)i 

.iiO<H< 1/2, iE[iE(|ei,„(a, 6)1' | J^x)] < C l2v(i+^)<.<|+^ + H+2^<^)' 
Both these inequalities may be reduced to only one for ah s > 2 V (i + 277) and H > Q: 



IE[]E{\ei^n{a,h)\^ \ J'x)] < 



Hence, with 5n = C^n 



2 ' 2H ' 

C (nJ^)'''^' if 2 V (i + 2^7) < s < (2 + A 1) V (I + 2^7) 

l+(gAl) 



• l+s{2HM) 



. < d < 1 if f2 + fifAl)Wfg + 4.) <s<oo 
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then 



b)f I ^x)] 0. 

n^oo 

To finish the proof of Lemma [6] it remains to show {n6n)]E[lE{\e2,n{a,b)\ \ J'x)] — >0- 

n—*oo 

/•oo 

Prom Lemma[5]it foUows that lE[lE[\e2,n{a,b)\ \ ^x)] < C / g{x) fn{x) dx where fn is the probabihty 

distribution function of T„ and g{x) = l(x.<i) + l(a;>i) a;^"^^. Since p > 3/4, g{x) < 1 for all x > and g is 
a non increasing map, 



g{x)fn{x)dx < 



1 



fn{x) dx + g{-msn5n) 



\msn5n 



fn{x) dx 



2-4p 



< P{Tn <^msn6n) + g{^msn6n) 

< P{\Tn-lE[Tn]\ > IE[Tn] -^msn6n) + i^msnSn) 

from Bienayme-Chebyshev Inequality since s > 2 and var (T^) < M^n^^ from the independence of (Lj)jgjv- 
Therefore (n5„)iE [iE(|e2,n (a, -^x)] — >0- ■ 

n— »oo 

2HA1 l+(-H"Ai) 2-4/3 

Proof. [Theorem [1] Denote Un(a) = {^6^) + (n(5^) ''"^ + [n6n) ■ Then, following the same 
method that in Bardet and Bertrand (2007b), pp. 33-35, one obtains 



]E\In{a)-Jn{a)\ <Cvn{a)'/\ 
and from this, Lemmas [6] and Slutsky Lemma, the proof is achieved. 
Proof of Proposition [2] 
Proof. It is obvious that 

i)= f I^A(e)|V(e/«K = A I 



(48) 



Ma)= /_|^A(e)r/(e/«)dc = A /_|v(A(e-i))| /(w^^c 

|2 „/l V 



Then, from a usual Taylor expansion, and since tp is supposed to be an even function supported in [—A, A], 



For A > 2A, then sup 

-A/X<h 



1\{a) - ||^||£2(jR) 

1 + h 



< 



1 



2a2A2 



sup 

~A/X<h 



a ' 



V I ijj (w) I dv 



f"{ 



\ < sup {|/"(2;)|} < 00. Therefore, since satisfies Assumption 

^ x>l/2a 



W(l, 5), there exists C > such that, 



,,„,/(l/a)| <c4 



(49) 
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Let us denote li^\a) (respectively Ixia), /3n^^ and Sn^\a)) instead on /n(a) (resp. Xi(a), and 5'„(a)) 
when ijj is replaced by ^px■ Firstly, 

Uattu^ [ \Maz)\^f{z)dz) = Ana [ fij{u)\^ f (- + ^) du 



— > A'Kaf^{l/a) I \i){u)\ du, 

X-*oo J ]R 

from Lebesgue Theorem. Hence, if (A„) is a sequence such that — > oo, 

n— »oo 

^^^"^ (5^)(a))^ 4vra/2(l/a) / |^(n)|^dn. (50) 



Secondly, from the proof of Proposition [3] and inequalities ([Mjl and (j35]) . there exists C > not depending 
on n and A, 



Pi^^/Si^^ < CI^\a) (n max \ck+i - CklV^^ ^''^ for all q £ (1,2). 

l<fc<n 



Thus, since I\{a) is bounded, Pn " /Sn " — > and Proposition [3] becomes: 

n—*oo 



AA(0,1) 



Finally, using (fl9]) and ([50]) . on deduces that for all a > 0, 

\[^{l^r^-\a)-\\^l, f{l/a)) ^ M(0A7raf\l/a) [ \^iu)\'du)., 



when (A„)„ is such that J — > 0, i.e. when A„^n(5„ — > 0. Since also A„^n5„ — > oo (to 

V " " n— »oo n— >oo n— >oo 

obtain a consistent estimator), then with 6n = n~'^ and A„, = n'^' , 

1-d 



<d' <l-d. (51) 
5 

Moreover, Proposition [5] has also to be checked. In its proof, lETn has to be replaced by lETn/\n and since 
the bounds C (1 A |^|~^) in Lemma[5]have to be replaced by C/A^ (1 A |^|~^), then condition n(5^ — > 



n— >oo 



has to be replaced by nJ^/A^ — > 0, that is d' > — - — which is satisfied when ([5T]) is satisfied. 

n— »oo 5 

It remains to control e^(a,Cfc) with Lemma [5] and [H For all 1 < < cxd, with l/oo = by convention, 

IIVaL, = A^^-^V^-? and = A^^-^^/^, ||^||^^_ 

Then, Lemma [5] b Gcomes (with. — > oo): 

n— *oo 

r .Ci||V||^iA„5^(^^i) ifs = oo; 

""1 n/ 



lE{\ei,nia,b)\' \J'x)<< 
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The case s < oo can be more detailed following the values of pi and p2 considered in Lemma [G) 



a/ i{H> 1/2 and 

• 2 < s <2 + {H Al), then |^ = s - 2 and ^ = (s - (if A 1) - l)/2 and therefore 
]E{\ei,n{a,b)f I ^x) <C(A3- + A2+(^^i)-^)(n5^) 

b)f I ^x) <C'Xn (nJ^); 

• 2 + {H Al) < s, then pi = s - 1 or |- = s - 2 and ^ = (1 + (iJ A l))/(2(s + 1)) and therefore 



s-(2+(gAl)) ^ 1 + (HA1) 

s — 1 



n 



6/ if < i7 < 1/2 and 

• 2 V (i + 27r) < ^ < i + 27r. then A = s - 2 or |- = ^ and ^ = F(s - (i + 2^)) and therefore 

^ ]E{\ei,n{a,b)\^ I ^x) < C"A„(n5^)'^; 



I + Tjir < s, then A = s- 2or — = ^ 



J and ^ = (1 + H)/{2{s + 1)) and therefore 



lE{\ei^n{a,b)\' I ^x) <C(A3-^(n5^)+Ar^(n5:)-i +A„ 



l-2/f(s-(i + ^)) 



in 



l+H 



]E[\ei^n{aM\ I ^x) < C'Xn{n6'^) . 



Note that the bound is not always optimal but it simplifies a lot of different subcases. Condition (j47p is 
now ——^ lE[\ei^n{a,b)\ \ Tx) — > 0. Therefore in any case this condition does not depend on A„ and it 

can be summarize with (5„ = n~'^ with (the case s = co is obtained by replacing with the limit): 
. if2v(i+2^)<-<(2 + (^Al))v(i + ^), d>i^^§^y 



• if (2 + A 1)) V (I + < s < 00, 
Finally, for b <Tn — Tn, with ^ satisfying Assumption W(l, 3, 1): 



d> 



s + {HAl) 
s{2 + {H Al))-1' 



Irp Irp \ Qi 

' J-n ^ J-n 




t'-b 



]E{X{t)X{t'))dtdt' 



< CfiaXn) 



-1 



< 



Cf (a^A^ ) 



t-b 

aXn 
(n) I u du 

00 



l + \t\) dt 

2 



2 X 



Therefore the CLT holds when A^ {n5nf~ 



0, i.e. --^ — > 00 since p > 3/4. 



(52) 
(53) 
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77 — n 9 


?7 — n ^ 
il — u.o 


w — n 8 

il — u.o 


i i 


y Mbtj oi JNK^) 


0.47 


0.65 


0.77 




Mlbhj on [0.3, oj 


2.53 


13.50 


80.89 


rprj 

i 2 


y Mbhi or Jn\^) 


U.OO 


U.d7 


U.75 




Mlbt, on [0.3, 5J 


3.d4 


10.65 


39.85 


rpQ 

i 


ylVlDil/ or jisf[l) 


U.4Z 


U.iz 


1 on 




mIsE on [0.3, 5] 


2.48 


7.83 


55.20 


T4 


VMSE of /Ar(l) 


1.03 


3.34 


2.44 




Mlbtj on [U.d, 5J 


0.07 


84.05 


1 A A An 

144. 4U 






11 — u . z 


TT — ^1 


TT — (1 R 


i i 


y Mbtj or jn\^) 


0.35 


0.37 


11.79 




Mlbh on [U.o, 5J 


0.95 


3.90 


57.19 


rpr) 


y Mbtj OI /A^(,ij 


0.45 


0.4/ 


n on 
0.29 




Mlbh on [0.3, 5J 


1.04 


3.17 


16.26 


±- 


\/MSE of f^r^^') 


47 


46 


95 




MISE on [0.3, 5] 


1.20 


4.91 


26.6 


T4 


VMS£; of /Ar(l) 


0.61 


0.61 


1.74 




MLS^ on [0.3, 5] 


2.74 


9.55 


49.55 






i/ = 0.2 


H = 0.5 


if = 0.8 


Tl 


^/MSE of /7v(l) 


0.36 


0.30 


0.40 




MLS^ on [0.3, 5] 


0.81 


2.60 


10.77 


T2 


VMSE of /iv(l) 


0.21 


0.22 


0.31 




MLS^; on [0.3, 5] 


1.07 


2.07 


7.65 


T3 


^/MSE of /jv(l) 


0.34 


0.26 


0.48 




MLS^ on [0.3, 5] 


0.74 


3.17 


13.3 


T4 


^/MSE of /iv(l) 


0.40 


0.56 


2.59 




MLS^ on [0.3, 5] 


1.02 


5.69 


41.41 



Table 2: Consistency of the estimator /jv in the case of paths of a FBM observed at random times f50 
independent replications are generated in each case). 
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a = 0.1 


a = 1 


a = 10 


Tl 


VMSE of /Ar(0.3) 
MISE on [0.3, 5] 

L 'J 


0.51 
0.022 


0.22 
0.014 


0.020 
0.00067 


T2 


VMSE of /iv(0.3) 

mTsE on [0.3, 5] 


0.30 

0.010 


0.30 

0.024 


0.021 

0.0010 


T3 


VMSE of /Ar(0.3) 
mIsE on [0.3,5] 


0.36 
0.00052 


0.23 
0.015 


0.018 
0.00052 


T4 


VMSE of /jv(0.3) 
mIsE on [0.3, 5] 


0.28 
0.016 


0.23 
0.016 


0.032 
0.0045 






a = 0.1 


a = 1 


a = 10 


Tl 


VMSE of /a, (0.3) 
mTsE on [0.3, 5] 


0.20 
0.0033 


0.18 
0.0088 


0.017 
0.00031 


T2 


VMSE of /Ar(0.3) 
MISE on [0.3, 5] 


0.14 
0.0032 


0.18 
0.0092 


0.019 
0.00036 


T3 


\/MSE of /Ar(0.3) 
i\7/5£: on [0.3. ■>] 


0.17 
0.0027 


0.18 

0.011 


0.016 

0.00032 


T4 


VMSE of /7v(0.3) 
MISE on [0.3, 5] 


0.18 
0.0058 


0.13 
0.0095 


0.024 
0.00037 






a = 0.1 


a = 1 


a = 10 


Tl 


VMS'E of /jv(0.3) 
MISE on [0.3, 5] 


0.14 

0.0016 


0.10 

0.0045 


0.012 

0.00015 


T2 


VMSE of /Ar(0.3) 
MLSi? on [0.3,5] 


0.26 
0.012 


0.13 
0.0055 


0.011 
0.00014 


T3 


VMSE of /jv(0.3) 
MLSi? on [0.3,5] 


0.18 
0.0023 


0.14 
0.0049 


0.012 
0.00017 


T4 


VMSE of /Ar(0.3) 
mIs^ on [0.3, 5] 


0.16 
0.0084 


0.16 
0.034 


0.017 
0.00019 



Table 3: Consistency of f]\f in the case of paths of a stationary Ornstein-Uhlenbeck process observed at 
random times (50 independent replications are generated in each case). 



